NCBI Sequence Viewer 



Page 1 of 3 



Search) Nucleotide Igg for 

Limits 




PMC Taxono?^y Bo« 

Details 



default 



Preview/index History Clipboard 

H[ Show: 1 20 y f |p|r^py| | File ;&f f :ffi^t Su^eP^<^P;|I.i# 



□ 1: Y13436 . Homo sapiens soxl...[gi:4128158] 



Links 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REMARK 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



gene 
CDS 



4091 bp 



DNA 



linear PRI 09-FEB-2001 



HSSOX1 
Homo sapiens soxl gene. 
Y13436 

Y13436.1 GI:4128158 

SOX1 gene; Sry-related Box 1 protein. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini ; Hominidae; Homo. 

1 

Malas, S., Duthie,S.M., Mohri , F . , Lovell -Badge, R . and Episkopou, V . 

Cloning and mapping of the human SOX1 : a highly conserved gene 

expressed in the developing brain 

Mamm. Genome 8 (11), 866-868 (1997) 

98051911 

9337405 

2 

Malas, S . 

Direct Submission 

Submitted (28 -MAY- 1997) S. Malas, MRC Clinical Sciences Centre, 

Mouse Embryology, Du Cane Rd, London, W12 ONN, London, UK 

Revised by [3] 

3 (bases 1 to 4091) 

Malas, S . 

Direct Submission 

Submitted ( 06 -JAN- 1999 ) S. Malas, MRC Clinical Sciences Centre, 
Mouse Embryology, Du Cane Rd, London, W12 ONN, London, UK 
On Jan 8, 1999 this sequence version replaced gi : 2230882 . 
Related sequences: AI279621, AI298071, AI215744, AA960996, D81624, 
R24723, R20579, T07302, R14439, AA961095, T06325, R46080. 

Location/Qualifiers 

1. .4091 

/organism="Homo sapiens" 

/mol_type=" genomic DNA" 

/db_xref = 11 1 axon: 96 06" 

/chromosome=" 13 " 

/map="q33-34" 

/clone="pSxBgl . 1" 

61 . . 1224 

/gene="SOXl" 

61 . . 1224 

/gene="SOXl" 

/codon_start=l 

/product="Sry-related Box 1 protein" 

/protein_id= " CAA73847.1 " 

/db_xref="GI : 2230883" 

/ db_xr e f = " GOA : 0 0 0 5 7 0 " 

/ db_xr e f = " SWISS- PROT : 0 0 0 5 7 0 " 

/translation^" MYSMMMETDLHS PGGAQAPTNLSG PAGAGGGGGGGGGGGGGGGA 



http://www.ncbi.nlm. nih.gov/entrez/viewer.fcgi?db=nucleotide&val=4128158 



2/5/04 



NCBI Sequence Viewer 



Page 2 of 3 



KANQDRVKRPMNAFMVWSRGQRRKMAQENPKMHNSE I S KRLGAEWKVMSEAEKRPF I D 
EAKRLRALHMKEHPDYKYRPRRKTKTLLKKDKYSLAGGLLAAGAGGGGAAVAMGVGVG 
VGAA PVGQRLE S PGGAAGGAYAHVNGWANGAY PG S VAAAAAAAAMMQEAQLAYGQH PG 
AGGAHPHRTPAHPHPHHPHAHPHNPQPMHRYDMGALQYSPISNSQGYMSASPSGYGGL 
PYGAAAAAAAAHQNSAVAAAAAAAAASSGALGALGSLVKSEPSGSPPAPAHSRAPCPG 
DLREMISMYLPAGEGGDPAAAAAAAAQSRLHSLPQHYQGAGAGVNGTVPLTHI" 

ORIGIN 

1 ccggccgtct atgctccagg ccctctcctc gcggtgccgg tgaacccgcc agccgccccg 

61 atgtacagca tgatgatgga gaccgacctg cactcgcccg gcggcgccca ggcccccacg 

121 aacctctcgg gccccgccgg ggcgggcggc ggcgggggcg gaggcggggg cggcggcggc 

181 ggcgggggcg ccaaggccaa ccaggaccgg gtcaaacggc ccatgaacgc cttcatggtg 

241 tggtcccgcg ggcagcggcg caagatggcc caggagaacc ccaagatgca caactcggag 

301 atcagcaagc gcctgggggc cgagtggaag gtcatgtccg aggccgagaa gcggccgttc 

361 atcgacgagg ccaagcggct gcgcgcgctg cacatgaagg agcacccgga ttacaagtac 

421 cggccgcgcc gcaagaccaa gacgctgctc aagaaggaca agtactcgct ggccggcggg 

481 ctcctggcgg ccggcgcggg tggcggcggc gcggctgtgg ccatgggcgt gggcgtgggc 

541 gtgggcgcgg cgcccgtggg ccagcgcctg gagagcccag gcggcgcggc gggcggcgcg 

601 tacgcgcacg tcaacggctg ggccaacggc gcctaccccg gctcggtggc ggccgcggcg 

661 gccgccgcgg ccatgatgca ggaggcgcag ctggcctacg ggcagcaccc cggcgcgggc 

721 ggcgcgcacc cgcaccgcac cccggcgcac ccgcacccgc accacccgca cgcgcacccg 

781 cacaacccgc agcccatgca ccgctacgac atgggcgcgc tgcagtacag ccccatctcc 

841 aactcgcagg gctacatgag cgcgtcgccc tcgggctacg gcggcctccc ctacggcgcc 

901 gcggccgccg ccgccgccgc gcaccagaac tcggccgtgg cggcggcggc ggcggcggcg 

961 gccgcgtcgt cgggcgccct gggcgcgctg ggctctctgg tgaagtcgga gcccagcggc 

1021 agcccgcccg ccccagcgca ctcgcgggcg ccgtgccccg gggacctgcg cgagatgatc 

1081 agcatgtact tgcccgccgg cgaggggggc gacccggcgg cggcagcagc ggccgcggcg 

1141 cagagccggc tgcactcgct gccgcagcac taccagggcg cgggcgcggg cgtgaacggc 

1201 acggtgcccc tgacgcacat ctagcgcctt cgggacgccg gggactctgc ggcggcgacc 

1261 cacgagctcg cggcccgcgc ccggctcccg ccccgccccg gcgcggcgtg gcttttgtat 

1321 cagacgttcc cacattcttg tcaaaaggaa aatactggag acgaacgccg ggtgacgcgt 

1381 gtcccccact caccttcccc ggagaccctg gcgaccgccg ggcgctgaca ccagacttgg 

1441 tttagactga acttcggtgt tttcttgaga cttttgtaca gtatttatca cctacggagg 

1501 aagcggaagc gttttctttg ctcgagggga caaaaaagtc aaaacgaggc gagaggcgaa 

1561 gcccactttt gtataccggc cggcgcgctc actttcctcc gcgttgcttc cggacggcgc 

1621 cgaccgccgg agcccaagtg acgcggagct cgtcgcattt gttataaatg tagtaaggca 

1681 ggtccaagca cttacaagtt ttttgtagtt gttaccgctc ttttgggttg gtttgttaat 

1741 ttatacaaag agattaccac caccaccccc tccttcagac ggcggagtta tattctgggt 

1801 tttgtaaaac tttatgtatc tgagcatttc catttttttt tttgggtttt gtattatttc 

1861 ttgtaaatgc attgtgaaaa attttatttt cggcgttgca atgcggggag gagaagtcag 

1921 attatgtaca tagttttcta aaaagccttt cttctaaaaa cgaaaaaaga cccccaccca 

1981 aaatgtttcg agtcaacaaa tttaagagac agagcccatt ttctccataa atttgtaaca 

2041 tgcctatttt tatgtgcatg ttttatgagt tcaaaatgca atgagggaaa tctgacaggg 

2101 aaattatctg tatgaactaa aagtaaggga acccggggaa tgggaggaca ggatttttca 

2161 aggaaccttt ttcaatgaaa gagaaggaag ttaaaaccta taggttattt tgtagagctg 

2221 agtgttaata cgggccgaga aataaaagta tcttctgctc cggctgtttc actgcggacg 

2281 gctggggctg ctgcgcgtta ccttgctgca acngggcgcc ttccacctgg ctgggggtct 

2341 gcgccacagt ttggtccaga ngwgggagga ggaagggaag accccagtgg tgggaccctg 

2401 gaccaggcca tggatgaagg acaaagacca gggcaggtca cgggtttccc aattccccag 

2461 caattaagat ttcgagcaga atttatctaa atgtgtttca aggaaacaca atcgctgaac 

2521 caaaacgtac tgcagccgan ccccctccgt ccatcctctg cccctccccc tggcttcttt 

2581 ctcttgggaa aacgggcaaa ataattgtgc tggattctca cacacacaga aatatcgacc 

2641 atcaccctcc cccgcgtgaa ctgggatgca agttgctaac cgatgtgaac gcaaaatgcc 

2701 ttgttcatta ttcctgacga gatcttgagg ttgtttgatg ctttaaattt tttaattata 

2761 ttattttcta ggtgtttatt ggtacattgc agtttttttt ttgaaattta aaaatttctg 

2821 taaaactttg tcttcaagta atctgacagc attaaatatt gcatttaaaa attatactgt 

2881 agcaaataca tttaaaaatt aatcacaacg ttaagatgaa attatatttt tggaaaaaaa 

2941 aaacacttga agcccagatg gaaatacgtt tatttcagca gccttaggtt tcccctcgct 

3001 ttctcaacac ccttccttgt cctggagtat ggactgtccg tccaaaagtg agcctatgct 

3 061 ataagtttaa tgagaaccga attcagcctg cattcgagaa tagctttaag tataatgctg 

3121 atctgacaat tgacgtgtaa tttgggaagt cattttgata attttgctta aaccactcat 
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3181 tcgttaaagt gattacaaaa aagttcaaga atgatgtcca ctgctttcta acaagataat 

3241 aaaccccccc cctcttttct ttttctttat ttttatttct tttagctatt tgatcctttc 

3301 tgaagcagtt gtttctggaa gagtctgtgc gcccatggat ggctgagcac cactacgact 

3361 tagtccggga taagggcctc cccagtcctc tccgggagat gatttgggaa attttataat 

3421 gcttgttctg ttaactcacc gggaccttga gggtccaatg ggaccttgag ggttttctct 

3481 gaaatataca aacttaaagg actctctctg aggttctttg actgacgtcc actctcagtc 

3541 tggcccctgt gctcccctgt gtgtaccctg gagtttctgt gtccaattgt tggcatctag 

3 601 gtcttggctc aagattagga tgtgggcccc actttagagg cacagactat gaaaagctga 

3661 gttagtgcgc ccgggacgcc aggcaagcag cttttacagt ttggcatctt attgcaggtg 

3721 cttcgtgcac agtcagctga aatagccaat gccaggtgct ccaaccacct tatttccttg 

3781 ttttgttgat tagaacaaca cagaaaaaag caaatataaa tttttaatga ctccatttaa 

3841 aaatatcaca gggtgggggc aaggaaatta gctgagattc atctcaggat tgagattcta 

3 901 tccccccttc cccgccccca gcagtgtcgc tccaattcaa attagtggag aaaagattac 

3 961 agtaggccct gagccgactg tgaattcggt gcttggccaa ggtaacactc atcgtattca 

4021 cggagraaat actatatgat gatagttatt atattatatg acgacttcat tcacttccca 

4081 aatcacaggg t 
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